[docs] Add Google-style docstrings for dspy/evaluate/metrics.py #8954

eramis73 · 2025-10-19T22:23:45Z

This PR adds Google-style docstrings to public metrics in dspy/evaluate/metrics.py.

Ensures correctness and clarity
Adds small usage examples
Passes pre-commit hooks

chenmoneygithub

Thanks for the PR!

chenmoneygithub · 2025-10-21T21:09:53Z

dspy/evaluate/metrics.py


 def EM(prediction, answers_list):  # noqa: N802
-    assert isinstance(answers_list, list)
+    """Return True if any reference exactly matches the prediction (after normalization).


The opening line should describe what this API is, instead of the API's behavior.

chenmoneygithub · 2025-10-21T21:10:31Z

dspy/evaluate/metrics.py

+            otherwise False.
+
+    Example:
+        >>> EM("The Eiffel Tower", ["Eiffel Tower", "Louvre"])


This doesn't render on mkdocs, let's use the block style, e.g.,:

my_code

chenmoneygithub · 2025-10-21T21:11:38Z

dspy/evaluate/metrics.py

+        >>> EM("paris", ["Paris"])
+        True
+    """
+    assert isinstance(answers_list, list)


let's don't mix fix with docstring changes. Actually this assert statement won't provide more information to users

chenmoneygithub · 2025-10-21T21:11:43Z

dspy/evaluate/metrics.py

+        >>> round(F1("Eiffel Tower is in Paris", ["Paris"]), 2)
+        0.33
+    """
+    assert isinstance(answers_list, list)


chenmoneygithub · 2025-10-21T21:12:12Z

dspy/evaluate/metrics.py

+        float: The highest HotpotQA-style F1 score in [0.0, 1.0].
+
+    Example:
+        >>> HotPotF1("yes", ["no"])


use block code

chenmoneygithub · 2025-10-21T21:12:54Z

dspy/evaluate/metrics.py


    def remove_articles(text):
-        return re.sub(r"\b(a|an|the)\b", " ", text)
+        return re.sub(r"\\b(a|an|the)\\b", " ", text)


do we need to change this?

chenmoneygithub · 2025-10-21T21:13:46Z

dspy/evaluate/metrics.py

 def answer_exact_match(example, pred, trace=None, frac=1.0):
+    """Example/Prediction evaluator for answer strings with EM/F1 thresholding.
+
+    If ``example.answer`` is a string, compare ``pred.answer`` against it.


nit: single backtick around variables: example.answer

chenmoneygithub · 2025-10-21T21:14:19Z

dspy/evaluate/metrics.py



 def answer_exact_match(example, pred, trace=None, frac=1.0):
+    """Example/Prediction evaluator for answer strings with EM/F1 thresholding.


This is too detailed for the open sentence

…k examples); revert non-doc changes

eramis73 · 2025-10-21T22:14:07Z

All feedback addressed (concise openings + mkdocs block examples)
Non-doc changes reverted
Ready for re-review. Thanks @chenmoneygithub!

chenmoneygithub

This is pretty good, thank you for the contribution, LGTM!

eramis73 · 2025-10-28T00:03:52Z

Thanks!

Ziems · 2025-10-28T15:45:31Z

This is so great I love these

eramis73 · 2025-10-28T21:52:47Z

Thanks!

commit 056d54e Author: Isaac Miller <[email protected]> Date: Wed Oct 29 17:23:09 2025 +0100 fix(MIPROv2): zero shot not taking .compile parameters into account before determining if the program was zero shot (stanfordnlp#8909) * fix(MIPROv2): zero shot not taking .compile parameters into account before determining if the program was zero shot * remove extra logs * Remove log * Fix merge conflict * Remove extra whitespace commit da69f9d Author: TomuHirata <[email protected]> Date: Wed Oct 29 13:23:34 2025 +0900 Update anthropic model name (stanfordnlp#8992) Signed-off-by: TomuHirata <[email protected]> commit aaadf05 Author: Chen Qian <[email protected]> Date: Tue Oct 28 12:21:55 2025 -0700 lints (stanfordnlp#8987) commit e842ba1 Author: eramis73 <[email protected]> Date: Tue Oct 28 02:40:34 2025 +0300 [docs] Add Google-style docstrings for dspy/evaluate/metrics.py (stanfordnlp#8954) * docs(metrics): add Google-style docstrings for public metrics * docs(metrics): address review feedback (concise openings, mkdocs block examples); revert non-doc changes * fixes --------- Co-authored-by: chenmoneygithub <[email protected]> commit 6c43880 Author: TomuHirata <[email protected]> Date: Tue Oct 28 07:21:06 2025 +0900 Cache Ollama to speed up CI (stanfordnlp#8972) * Cache Ollama to speed up CI * fix permission commit 462baef Author: Copilot <[email protected]> Date: Mon Oct 27 11:57:27 2025 -0700 Fix TypeError when tracking usage with Anthropic models returning Pydantic objects (stanfordnlp#8978) * Initial plan * Fix TypeError when merging Anthropic CacheCreation objects in usage tracker Co-authored-by: TomeHirata <[email protected]> * Enhance _flatten_usage_entry to convert Pydantic models on first add Co-authored-by: TomeHirata <[email protected]> * Fix potential TypeError when both usage entries are None Co-authored-by: TomeHirata <[email protected]> * simplify * small fix * lint * robust version handling --------- Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: TomeHirata <[email protected]> Co-authored-by: chenmoneygithub <[email protected]> commit 9b467b5 Author: Noah Ziems <[email protected]> Date: Mon Oct 27 13:32:07 2025 -0400 Add Disable Fallback Option in ChatAdapter (stanfordnlp#8984) commit bf022c7 Author: Lakshya A Agrawal <[email protected]> Date: Sat Oct 25 23:37:42 2025 +0530 Update gepa[dspy] dependency version to 0.0.18 (stanfordnlp#8969) * Update gepa[dspy] dependency version to 0.0.18 * Update pyproject.toml * fix test --------- Co-authored-by: TomuHirata <[email protected]>

commit 31b96af Author: Dushmanta <[email protected]> Date: Thu Oct 30 13:52:40 2025 +0530 fix: broken PyPI downloads badge from pepy.tech in README and docs home page (stanfordnlp#8995) * fix: update broken pypi download badge in readme * fix: update broken pypi download badge in docs home page commit 056d54e Author: Isaac Miller <[email protected]> Date: Wed Oct 29 17:23:09 2025 +0100 fix(MIPROv2): zero shot not taking .compile parameters into account before determining if the program was zero shot (stanfordnlp#8909) * fix(MIPROv2): zero shot not taking .compile parameters into account before determining if the program was zero shot * remove extra logs * Remove log * Fix merge conflict * Remove extra whitespace commit da69f9d Author: TomuHirata <[email protected]> Date: Wed Oct 29 13:23:34 2025 +0900 Update anthropic model name (stanfordnlp#8992) Signed-off-by: TomuHirata <[email protected]> commit aaadf05 Author: Chen Qian <[email protected]> Date: Tue Oct 28 12:21:55 2025 -0700 lints (stanfordnlp#8987) commit e842ba1 Author: eramis73 <[email protected]> Date: Tue Oct 28 02:40:34 2025 +0300 [docs] Add Google-style docstrings for dspy/evaluate/metrics.py (stanfordnlp#8954) * docs(metrics): add Google-style docstrings for public metrics * docs(metrics): address review feedback (concise openings, mkdocs block examples); revert non-doc changes * fixes --------- Co-authored-by: chenmoneygithub <[email protected]> commit 6c43880 Author: TomuHirata <[email protected]> Date: Tue Oct 28 07:21:06 2025 +0900 Cache Ollama to speed up CI (stanfordnlp#8972) * Cache Ollama to speed up CI * fix permission commit 462baef Author: Copilot <[email protected]> Date: Mon Oct 27 11:57:27 2025 -0700 Fix TypeError when tracking usage with Anthropic models returning Pydantic objects (stanfordnlp#8978) * Initial plan * Fix TypeError when merging Anthropic CacheCreation objects in usage tracker Co-authored-by: TomeHirata <[email protected]> * Enhance _flatten_usage_entry to convert Pydantic models on first add Co-authored-by: TomeHirata <[email protected]> * Fix potential TypeError when both usage entries are None Co-authored-by: TomeHirata <[email protected]> * simplify * small fix * lint * robust version handling --------- Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: TomeHirata <[email protected]> Co-authored-by: chenmoneygithub <[email protected]> commit 9b467b5 Author: Noah Ziems <[email protected]> Date: Mon Oct 27 13:32:07 2025 -0400 Add Disable Fallback Option in ChatAdapter (stanfordnlp#8984) commit bf022c7 Author: Lakshya A Agrawal <[email protected]> Date: Sat Oct 25 23:37:42 2025 +0530 Update gepa[dspy] dependency version to 0.0.18 (stanfordnlp#8969) * Update gepa[dspy] dependency version to 0.0.18 * Update pyproject.toml * fix test --------- Co-authored-by: TomuHirata <[email protected]>

docs(metrics): add Google-style docstrings for public metrics

0140a95

chenmoneygithub self-requested a review October 20, 2025 17:32

chenmoneygithub reviewed Oct 21, 2025

View reviewed changes

docs(metrics): address review feedback (concise openings, mkdocs bloc…

125fbbb

…k examples); revert non-doc changes

fixes

7135240

chenmoneygithub approved these changes Oct 27, 2025

View reviewed changes

chenmoneygithub merged commit e842ba1 into stanfordnlp:main Oct 27, 2025
10 checks passed



		def answer_exact_match(example, pred, trace=None, frac=1.0):
		"""Example/Prediction evaluator for answer strings with EM/F1 thresholding.

[docs] Add Google-style docstrings for dspy/evaluate/metrics.py #8954

[docs] Add Google-style docstrings for dspy/evaluate/metrics.py #8954

Conversation

eramis73 commented Oct 19, 2025

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eramis73 commented Oct 21, 2025

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eramis73 commented Oct 28, 2025

Uh oh!

Ziems commented Oct 28, 2025

Uh oh!

eramis73 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants